Concept based clustering for descriptive document classification

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering Techniques for Document Classification

This paper is intended to study the existing classification and information retrieval techniques in order to use an algorithm that will group the a set of documents. Therefore, the unfolding of knowledge in texts is selected as the proper methodology to be followed and the steps are explained in order to reach the unsupervised documents classification. After conducting an experiment with three ...

متن کامل

Model Based Document Classification and Clustering

In this paper we develop a complete methodology for document classification and clustering. We start by investigating how the choice of document features, such as weights, transformations, and dimensionality reduction, influences the performance of document classification. We then used these findings to construct a model based document clustering (MBDC) algorithm suitable for document collectio...

متن کامل

Concept-based Mining Model for Web Document Clustering

Most of the document clustering techniques are based on statistical analysis of a term, either a word or phrase.The statistical analysis of a term frequency captures the importance of the term within the document only. Thus, the underlying mining model should indicate terms that capture the semantics of the text. In this case, The mining model can capture terms that present the concepts of the ...

متن کامل

Feature Reduction for Document Clustering and Classification

Often users receive search results which contain a wide range of documents, only some of which are relevant to their information needs. To address this problem, ever more systems not only locate information for users, but also organise that information on their behalf. We look at two main automatic approaches to information organisation: interactive clustering of search results and pre-categori...

متن کامل

Distributed Document and Phrase Co-embeddings for Descriptive Clustering

Descriptive document clustering aims to automatically discover groups of semantically related documents and to assign a meaningful label to characterise the content of each cluster. In this paper, we present a descriptive clustering approach that employs a distributed representation model, namely the paragraph vector model, to capture semantic similarities between documents and phrases. The pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data Science Journal

سال: 2007

ISSN: 1683-1470

DOI: 10.2481/dsj.6.91